CDS

Accession Number TCMCG075C25961
gbkey CDS
Protein Id XP_007012889.2
Location 4788061..4789518
Gene LOC18588428
GeneID 18588428
Organism Theobroma cacao

Protein

Length 485aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007012827.2
Definition PREDICTED: pentatricopeptide repeat-containing protein At2g15690 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Pentatricopeptide repeat-containing protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0000096        [VIEW IN EMBL-EBI]
GO:0000097        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0006082        [VIEW IN EMBL-EBI]
GO:0006520        [VIEW IN EMBL-EBI]
GO:0006534        [VIEW IN EMBL-EBI]
GO:0006790        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0008652        [VIEW IN EMBL-EBI]
GO:0009058        [VIEW IN EMBL-EBI]
GO:0009069        [VIEW IN EMBL-EBI]
GO:0009070        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016053        [VIEW IN EMBL-EBI]
GO:0019344        [VIEW IN EMBL-EBI]
GO:0019752        [VIEW IN EMBL-EBI]
GO:0043436        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044249        [VIEW IN EMBL-EBI]
GO:0044272        [VIEW IN EMBL-EBI]
GO:0044281        [VIEW IN EMBL-EBI]
GO:0044283        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0046394        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:1901564        [VIEW IN EMBL-EBI]
GO:1901566        [VIEW IN EMBL-EBI]
GO:1901576        [VIEW IN EMBL-EBI]
GO:1901605        [VIEW IN EMBL-EBI]
GO:1901607        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGTCTCTGGTGTCCCTTCAAAACACGGGAAATTCCATTAGCTTCTCTTCCCACTTAAAATCTTCCATCAATGCCAAGCACTCTCTCGCCTTCAACGAGAAAATCAAGCTCTTTTTTTCTCAAAACCCCAACAAACTCAAGCCTCCATGCACCTACGCGGCATCCCATAACACTGATAGCAACAGCTCTACCAGGACTTACCGCCGCCAAAGTGTCACAACTCACCGCCAAACCACGAAAAATCGGCCGAACCCTCGTAAATTTAACACTGAAAACCGAAACGAAAACCACCCATCTCCTGAAAATCCAACATTTCAAAGTATTACCGTGGATTTGATGAAGTTATGCAAAGAAGGCAAGGTTAAGGAAGCTTTAGATTATATGGGTCAAGGTGTTTTAGCGGATTTTAATGTTTTCGGGGCGTTACTGGATGCTTGCGGGAATATGAATTCGCTAGAATTAGCCAGAAGAGTTAATGAATTTTTTAGAAGATCAAAGTTTTCTGGTGATATCGAATTGAACAACAAATTGATAGATATTTATGGGAAATGCGCTAGTATAAGAGATGCTCGCAGGGTGTTCGATAAAATGCGTGAGCGAAATATGGCTTCTTGGAATTTGATGATAAATGACTATGCGGTGAATGGGAAAGGAGATGATGGATTGTTGTTGTTTGAGGATATGAGAAAAGATGGGTTTCAGCCGGATAGCGAAACTTTTCTGGCGGTTCTATCGGCTTGTGCCACTGTGGCGGCTGTGGAGGAAGGAATAATGTACTTTGAATTGATGAAGAATGAATACAGGATTGCTCCGGGAGTTGAACATTATTTAGGAGTGATTGATGTTTTTGGGAGAGCTGGGTATTTGAATGAGGCTGTGGAGTTTATTGAGAATATGCCTATTGAGCCAACGGTGGAGATTTGGGAGGCAATCAGGGGTTTTGCACGAATTCATGGAGATATTGACCTTGAGGATCATTTTGAGGAGTTGTTGCTTGGATTTGATCCTCCTATGAGAAGTGAGAATGAACACCAAGCACCACCAAGGAAGAAGCATTCTGTGATTAACATGATTGAGGAGAAGAATAGGGTGATTGAGTATCGGTGTATGAACCCTTTCAAGGGAGAAGTAAATGAGAAGCTGAAAGGTTTGAATGGGCAGATGAGGGAAGCAGGGTATGTGCCTGATACAAGGTATGTGCTTCATGATATTGATCAGGAGGCAAAAGAGCAGGCCTTGCAATACCATAGTGAGCGTTTGGCAATTGCTTATGGTCTTATTAGCACTCCGGCAAGGACACCTCTTAGGATCATTAAGAACCTGAGAATCTGCGGCGACTGCCACAATGCAATAAAAATCATGTCCAAGATTGTTGGGAGAGAGTTGATTGTGAGGGATAACAAGCGCTTTCATCACTTCAGGGATGGCAAATGCTCGTGTGGTGATTACTGGTAA
Protein:  
MASLVSLQNTGNSISFSSHLKSSINAKHSLAFNEKIKLFFSQNPNKLKPPCTYAASHNTDSNSSTRTYRRQSVTTHRQTTKNRPNPRKFNTENRNENHPSPENPTFQSITVDLMKLCKEGKVKEALDYMGQGVLADFNVFGALLDACGNMNSLELARRVNEFFRRSKFSGDIELNNKLIDIYGKCASIRDARRVFDKMRERNMASWNLMINDYAVNGKGDDGLLLFEDMRKDGFQPDSETFLAVLSACATVAAVEEGIMYFELMKNEYRIAPGVEHYLGVIDVFGRAGYLNEAVEFIENMPIEPTVEIWEAIRGFARIHGDIDLEDHFEELLLGFDPPMRSENEHQAPPRKKHSVINMIEEKNRVIEYRCMNPFKGEVNEKLKGLNGQMREAGYVPDTRYVLHDIDQEAKEQALQYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSKIVGRELIVRDNKRFHHFRDGKCSCGDYW